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PROGRAMMABLE VIDEO PROCESSING AND VIDEO STORAGE 

ARCHITECTURE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
The present patent application is related to U.S. Patent Application Serial No. 
10/600,047 (Client Docket No.: P15961, Attorney Docket No.: 884.898US1), to Louis A. 
Lippincott, entitled "Processor to Processor Communication in a Data Driven 
Architecture", filed on June 19, 2003, which is assigned to the assignee of the present 
patent application. The present patent application is also related to U.S. Patent 
Application Serial No. 10/600,048 (Client Docket No.: PI 5962, Attorney Docket No.: 
884.899US1), to Louis A. Lippincott, entitled "Communication Ports in a Data Driven 
Architecture", filed on June 19, 2003, which is assigned to the assignee of flie present 
patent application. 

TECHNICAL FIELD 

This mvention relates generally to electronic data processing and more particularly, 
to a programmable video processing and video storage architecture. 

BACKGROUND 

A number of different electronic devices have been developed to assist viewers in 
recording and viewing of video. One such device that is increasing in demand is the 
digital video recorder that allows the user to pause live television, rewind, etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Embodiments of the invention may be best understood by referring to the 
following description and accompanying drawings which illustrate such embodiments. 
The numbering scheme for the Figures included herein are such that the leading ntxmber 
for a given reference number in a Figure is associated with the number of the Figure. For 
example, a system 100 can be located in Figure 1. However, reference numbers are the 
same for those elements that are the same across different Figures. In the drawings: 
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Figure 1 illustrates a simplified block diagram of a system configuration that 
includes a programmable video processing and video storage architecture, according to 
one embodiment of the invention. 

Figure 2 illustrates a more detailed block diagram of a programmable processing 
5 unit, according to one embodiment of the invention. 

Figure 3 illustrates a more detailed block diagram of programmable processor, 
according to one embodiment of the invention. 

Figure 4 illustrates a port ring and associated ports of a programmable processor, 
according to one embodiment of the invention. 
10 Figure 5 illustrates a FIFO memory within a transmitter or receiver port and 

associated interface signals for the memory, according to one embodiment of the 
invention. 

Figure 6 illustrates a more detailed block diagram of a receiver port along with 
associated mterface signals, according to one embodiment of the invention. 
15 Figures 7A-7G illustrate a more detailed block diagram of a receiver port 

communicating with different transmitter ports, accordmg to one embodiment of the 
invention. 

Figure 8 illustrates a more detailed block diagram of a transmitter port along with 
associated interface signals, according to one embodiment of the invention. 
20 Figures 9A-9E illustrate a more detailed block diagram of a transmitter port 

communicating with different receiver ports that using different interface signals, 
according to one embodiment of the invention. 

Figure 10 illustrates a number of different routes for a given logical coimection 
from a source programmable processor to a destination programmable processor, 
25 according to one embodiment of the invention. 

Figure 11 illustrates a flow diagram for establishing and initializing of a logical 
coimection within a programmable processing unit, according to one embodiment of the 
invention. 

Figure 12 illustrates a flow diagram for processing of data by a programmable 
30 processing unit, according to one embodiment of the invention. 

Figures 13A-13B illustrate flow diagrams for communications among memories 
of different ports in a programmable processor, according to embodiments of the 
invention. 

2 
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F^re 14 illustrates a flow diagram for multiple mode operations of a 
programmable processing unit, according to one embodiment of the invention. 

DKTAn.ED DESCRIPTION 

5 Methods, apparatus and systems for a programmable video processing and video 

storage architecture are described In the following description, numerous specific details 
are set forth. However, it is imderstood that embodiments of the invention may be 
practiced without these specific details. In other instances, well-known circuits, structures 
and techniques have not been shown in detail in order not to obscure the understanding of 

10 this description. While described with reference to processing and storage of video, 
embodiments of the invention are not so limited. For example, embodiments of the 
mvention may include the processing and storage of audio, metadata that is part of the 
video and/or audio (e.g., close captioning, etc.), etc. 

This detailed description is divided into two sections. In the first section, a 

15 hardware and operating environment are presented. In the second section, the operations 
of a programmable processing unit of a programmable video processing and video storage 
architecture are presented. Additionally, the first and second sections have been divided. 
The first section is divided into a description of a system, a description of a programmable 
processing unit, a description of a programmable processor, a description of a port ring 

20 and ports of a programmable processor and a description of the logical connections among 
the programmable processors. The second section is divided into a description of data 
transmission operations among the programmable processors and a description of multiple 
mode operations of the programmable processing unit 

25 Hardware and Operating Environment 

In this section, a hardware and operating environment are presented. 

System Description 

Figure 1 illustrates a simplified block diagram of a system configuration that 

30 includes a programmable video processing and video storage architecture, according to 

one embodiment of the invention. As shown, Figure 1 illustrates a system 100 that 

includes a video tuner 102, a video signal processor 104, a programmable processing unit 

106, a memory 108, a video display processor 110 and a video display 1 12. In one 

3 
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embodiment, the system 100 can include components that may be within any of a nwnber 
of different viewing devices, such as different types of televisions, con^uter monitors, etc. 
In another embodiment, parts of the components may be part of a set-top box, media 
center, etc. For example, the video tuner 102, the video signal processor 104, tiie 
programmable processing unit 106 and the memory 108 may be within a set-top box, 
while the video display processor 1 10 and the video display 1 12 are within a display 
device (e.g., a television). In an embodiment, the memory 108 can be different types of 
random access memory (RAM). For example, the memory 108 can be a Synchronous 
Dynamic RAM (SDRAM), DRAM, a double data rate (DDR) Synchronous Dynamic 
RAM (SDRAM), etc. 

The video tuner 102 is coupled to Ihe video signal processor 104. The video signal 
processor 104 is coi^led to the programmable processmg unit 106. The programmable 
processing unit 106 is coi5)led to the memory 108 and the video display processor 1 10. 
The video display processor 110 is coupled to flie video display 1 12. The video tuner 102 
is coupled to receive a signal 1 14. In an embodiment, the signal 1 14 is a Radio Frequency 
(RF) signal. The video tuner 102 receives the signal 1 14 and converts the signal 1 14 into 
an analog video signal 1 16 based on flie channel tiiat is currently selected for viewing on 
the video display 112. 

The video signal processor 104 receives tiie analog video signal 1 16 and converts 
the analog video signal 1 16 mto a digital video signal 1 18. In one embodiment, the video 
signal processor 104 filters the analog video signal 1 16 prior to conversion to the digital 
video signal 1 18. For example, the video signal processor 104 may filter the analog video 
signal 1 16 to remove noise from adjacent channels. In one embodhnent, the video signal 
processor 104 filters the analog video signal 1 16 to remove noise from the audio signal. 
The digital video signal 118 may be a stream of data that can include sync signals for 
display on the video display 1 12, 

The programmable processing unit 106 receives the digital video signal 1 18. As 
will be further described below, in an embodiment, the programmable processing unit 106 
performs different operations depending on the current mode of the system 100. In one 
such embodiment, the programmable processing unit 106 operates in one of two modes: 1) 
a storage mode and 2) an unage processing mode. In the storage mode, the user of the 
system 100 may be saving or replaying video that has been received mto the system 100. 

In an embodiment, ttie programmable processmg unit 106 stores tiie video into the 

4 
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memory 108. To fllustrate, assume that the system 100 is part of a television. The user 
may save video being received on the signal 1 14. For exanq>le, the ns^r may pause the 
video being displayed on the video display 1 12 for a given period of time. Additionally, 
in the storage mode, the user may replay the video that is displayed on the video display 

5 1 12, thereby causmg storage of the incoming video to be stored into the memory 108. 

In an embodiment, if the user is not placing the programmable processing unit 106 
into the storage mode, the programmable processing unit 106 is in the image processing 
mode. In one embodiment, the programming processing unit 106 remains in the storage 
mode until the memory 108 does not include stored video for display. The image 

10 processing mode can be enabled when a user is not attempting to replay video that is saved 
within the memory 108 or when a user is not attempting to save video into the memory 
108 that is being received into the system 100. As fiuther described below, in one such 
embodiment, the programmable processing unit 106 performs enhanced video processing 
on the incoming video signal. Such enhancement may include reduction of ghosting, 

15 noise, dot-crawl, etc. The programmable processmg unit 106 may use the memory 108 to 
store multiple frames of the incoming video. Accordingly, the programmable processing 
unit 106 may perform enhancement of the image across a number of frames. For exanq)le, 
the programmable processing unit 106 may perform temporal noise reduction across a 
number of frames. The programmable processing unit 106 and the operations therein are 

20 described in more detail below. 

The video display processor 110 receives the video from the programmable 
processing unit 106 and may perform a number of operations on the video for display on 
the video display 1 12. For example, the video display processor 1 10 may adjust the video 
based on tint settings, contrast settings, etc. The video display 1 12 is coupled to receive 

25 the signal from the video display processor 110. While the video display 1 12 may be a 
number of different types of displays, in one embodiment, the video display 1 12 may be a 
cathode ray tube (CRT), a plasma display, a liquid crystal display (LCD), etc. 

Accordingly, embodiments of the invention include a flexible programmable 
architecture to provide alternative functionality based on the mode of operation. Such 

30 functionality includes operations of a digital video recorder and operations that allow for 

enhanced image processing of the video. Therefore, display devices that incorporate 

embodiments of the invention may offer the ability to rewind, pause, fast forward and play 

for a given channel of as well as improved image quality of the broadcasted content. 

5 
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Description of a Programmable Processing Unit 

One embodiment of the programmable processing unit 106 is now described- In 
particular, Figure 2 illustrates a more detailed block diagram of a programmable 
processing unit, according to one embodiment of the invention. As shown, the 
programmable processing unit 106 includes programmable processors 202A-202H. The 
programmable processors 202A-202H include port rings 250A-250H, respectively. As 
further described below, the port rings 250A-250H include a number of ports through 
which the programmable processors 202A-202H transmit interface (control and data) 
signals. In an embodiment, a given port ring 250 includes eight I/O ports, wherein each 
such I/O port is a bi-directional connection such that data can be sent and received 
simultaneously through two separate unidirectional data buses. In other words, an I/O port 
includes a transmitter port and a receiver port 

The programmable processing unit 106 also includes a Durect Memory Access 
(DMA) unit 204, a DMA unit 210, a memory interface 206, an input/output (I/O) interface 
208, and a host processor 214. While Figure 2 illustrates eight programmable processors 
202, embodiments are not so limited, as a greater and/or a lesser number of such elements 
may be incorporated into embodiments of the progranmiable processmg unit 106. In one 
embodiment, the host processor 214 may be different types of general purpose processors. 
The I/O interface 208 provides an interface to I/O devices or peripheral components for 
the programmable processing unit 106. The I/O mterface 208 may comprise any suitable 
interface controllers to provide for any suitable communication link to dififerent 
components of the programmable processing unit 106. The I/O interface 208 for one 
embodiment provides suitable arbitration and buffering for one of a number of interfaces. 

As shown, the interconnections among the programmable processors 202A-202H 
provides for a point-to-point nearest neighbor configuration, wherein a given 
programmable processor 202 is physically connected to four otiier elements (e.g., a 
different progranmiable processor 202, the DMA unit 204) witiiin the programmable 
processing unit 106. In other words, a given programmable processor 204 is not 
physically connected to every other programmable processor 204 within the 
programmable processing unit 106. As further described below, data may be transmitted 
from a source programmable processor 202 to a destination programmable processor 202 

through a series of intermediate programmable processors 202. In an embodiment, the 

6 
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transmission through tiie series of intermediate programmable processors 202 is such that 
the data is received on a receiver port of tiie intermediate programmable processor 202 and 
is ou^utted on a transmitter port of the intermediate programmable processor 202 through 
the port ring 250. Accordingly, no processor elements within the intermediate 
5 programmable processor 202 perform a process operation on the data as part of the 
transmission of the data from the source to the destination programmable processor 202. 

Through the port ring 250A, the programmable processor 202A is coupled to the 
I/O interface 208 through two I/O ports and is coupled to the port ring 250D of the 
programmable processor 202D through a different I/O port. Through the port ring 250A, 
10 the programmable processor 202A is coupled to the DMA unit 204 through two other 

different I/O ports. Thiou^ the port ring 250A, the programmable processor 202A is also 
coiq)led to tiie port ring 250B of the programmable processor 202B through two more 
different I/O ports. Furdier, through the port ring 250A, flie programmable processor 
202A is coiq)led to the port ring 250E of the programmable processor 202E through two 
15 other I/O ports. 

Throu^ the port ring 250B, the programmable processor 202B is coi^)led to flie 
DMA unit 204 through two different I/O ports. Through the port ring 250B, the 
programmable processor 202B is also coiq)led to the port ring 250C of the programmable 
processor 202C through two other different I/O ports. Through the port ring 250B, the 
20 programmable processor 202B is coupled to the port ring 250F of the programmable 
processor 202F through two more different I/O ports. 

Through the port ring 250C, the programmable processor 202C is coi^led to the 
DMA unit 204 through two different I/O ports. Through the port ring 250C, the 
programmable processor 202C is also coupled to the port ring 250D of the programmable 
25 processor 202D through two other different I/O ports. Through the port ring 250C, the 
programmable processor 202C is coupled to the port ring 250G of the programmable 
processor 202G through two more different I/O ports. 

Through tiie port ring 250D, the programmable processor 202D is coupled to the 
DMA unit 204 through two different I/O ports. Through the port ring 250D, the 
30 programmable processor 202D is also coupled to the DMA unit 210 through two I/O ports 
and is co\5)led to the port ring 250A of the programmable processor 202A through a 
different I/O port. Through the port ring 250D, Ae programmable processor 202D is 



7 
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coiq)led to tiie port ring 2S0H of the programmable processor 202H through two more 
different I/O ports. 

Through the port ring 250E, the programmable processor 202E is coupled to flie 
I/O interface 208 through two I/O port and is coupled to the port ring 250H of the 
5 programmable processor 202H through a different I/O port. Through the port ring 250E, 
the programmable processor 202E is also coupled to the port ring 250F of the 
programmable processor 202F through two more different I/O ports. Further, through the 
port ring 250E, the programmable processor 202E is coupled to the port ring 250A of the 
programmable processor 202A through two other I/O ports. 
10 Through the port ring 250F, the programmable processor 202F is coupled to the 

port ring 250G of the programmable processor 202G through two other different I/O ports. 
Through the port ring 2S0F, the programmable processor 202F is coupled to the port ring 
2S0B of the programmable processor 202B trough two more different I/O ports. 

Through the port ring 2S0G, the programmable processor 202G is coupled to the 
15 port ring 2S0H of the programmable processor 202H through two different I/O ports. 

Through the port ring 2500^ the progranunable processor 202G is coupled to the port ring 
2S0C of the programmable processor 202C through two more different I/O ports. 

Througji the port ring 250H, the programmable processor 202H is also coupled to 
the DMA unit 212 through two I/O ports and is coupled to the port ring 2S0E of the 
20 programmable processor 202E througjh a different I/O port. Througji the port ring 250H, 
the programmable processor 202H is coupled to tiie port ring 250D of flie programmable 
processor 202D through two more different I/O ports. 

The I/O interface 208 may also be externally coupled to different external devices. 
In an embodiment, the I/O interface 208 is externally coupled to receive the digital video 
25 signal 118 from the video signal processor 104 and is externally coupled to transmit the 
digital video signal 1 18 to the video display processor 1 10. In one embodiment, the I/O 
interface 208 may be externally coupled to other programmable processing units 106, 
thereby allowing for the expansion of the number of programmable processors 202 that 
can commimicate and process image data together. In one embodiment, a number of the 
30 programmable processing unitsl06 may be daisy-chained together to allow for the 
processing of data across a number of different programmable processing units 106. 

The DMA unit 204 is coupled to the memory inter&ce 206. The memory interface 
206 is coupled to the memory 108. As will be described m more detail below, data (such 
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as output fix)m a result of a process operation fiom one of the imgramxnable processors 
202) can be stored into and read from the memory 108 through the DMA unit 204 and the 
memory interface 206. 

The host processor 214 is coupled to an internal global bus 216. Although not 
5 shown in Figure 2, the internal global bxis 216 is coupled to the different elements within 
the programmable processing unit 106. Accordingly, the host processor 214 may directly 
conunijnicate with/configure each of the different elements within the prograimnable 
processing unit 106. 

1 0 Description of a Programmable Processor 

Figure 3 illustrates a more detailed block diagram of programmable processor, 
according to one embodiment of the invention. In particular, Figure 3 illustrates a more 
detailed block digram of one of the programmable processors 202, according to one 
embodiment of the invention. 

1 5 The programmable processor 202 includes an input processor element 302, an 

output processor element 304, a numb^ of processor elements 306A-306C, a number of 
registers 308, a number of accelerator units 310A-31OB, a memory 314 and a memory 
controller 316. The input processor element 302, the output processor element 304, the 
processor elements 306A-306C, the accelerator units 310A-310B and the memory 314 

20 (through the memory controller 3 1 6) are coi^led to the registers 308. The registers 308 
allow the processor elements 302, 304 and 306, the accelerator units 310A-310B and the 
memory 314 to exchange data and can be used as general purpose registers for a given 
processor element 302, 304 and 306 and the accelerator units 3 10A-310B. Moreover, the 
processor elements 302, 304 and 306 and the accelerator units 310A-3 lOB may include a 

25 number of local registers (not shown). 

In an embodiment, the input processor element 302, the output processor element 
304 and the processor elements 306A-306C include an instruction memory and an 
arithmetic-logic unit (ALU) for processing of the data. The input processor element 302 
and the output processor element 304 are coupled to the ports of the programmable 

30 processor 202 through the port ring 250 to receive data being inputted into and to transmit 
data being outputted from, respectively, the programmable processor 202 (which is 
described m more detail below in conjunction wifli Figures 4-9). In addition to inputting 
and outputting of data, the input processor element 302 and/or the output processor 
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element 304 may process the data (similar to the processing provided by the processor 
elements 306A-306C). The different processor elements 306A-306C may be general 
purpose processor elements or special pxupose processor elements. For example, the 
processor elements 306A-306C may be Multiply-Accumulate (MAC) processor elements 
5 that include an instruction set for general purpose processing as well as an instruction set 
for MAC functionality. The processor elements 306A-306C may be a combination of 
general purpose processor elements and special purpose processor elements. For example, 
the processor elements 306A and 306C may be MAC processor elements, while the 
processor elements 306B may be a general purpose processor element. While Figure 3 

10 illustrates five processor elements within the programmable processor 202, in other 

embodiments, a lesser or greater number of such processor elements may be incorporated 
into the programmable processor 202. 

The input processor element 302 is a general purpose processor element with a port 
interface as an input port. In an embodiment, the instructions within the input processor 

15 element 302 have the ports as additional input operands along with the registers 308 and 
the local registers within the input processor element 302. The output processor element 
304 is a general purpose processor element with a port interface as an output port. In an 
embodiment, the instructions within the output processor element 304 have the ports as 
additional ou^ut operands along with the registers 308 and the local registers within the 

20 output processor element 304. 

Port Ring and Ports of a Programmable Processor 

Figure 4 illustrates a port ring and associated ports of a programmable processor, 
according to one embodiment of the invention. The programmable processor 202 is 

25 coupled to input and output data to and from ports 404A-404H through the port ring 250. 
As shown, in an embodiment, the ports 404A-404H are bi-directional data coimections 
that allow for data to flow from one programmable processor 202 to a different unit (such 
as a different programmable processor 202, the DMA unit 204, the I/O interface 208). 
A given port 404A-404H comprises a receiver port and a transmitter port for 

30 receiving data into and transmitting data out from the port 404, respectively. In particular, 
the ports 404A-404H include receiver ports 406A-406H and transmitter ports 408A-408H, 
respectively. An embodiment of a receiver port and an embodiment of a transmitter port 
are described below in conjunction with Figure 6 and Figure 7, respectively. In an 
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embodiment, the programmable ptocessor 202 is comiected to adjacent (nearest neighbor) 
programmable processors 202 (as illustrated m Figure 2) through the ports 404A-404H. 

One embodiment of receiver and transmitter port (within one of the ports 404) 
which includes FIFO memories will now be described. Figure 5 illustrates a FIFO 
5 memory within a transmitter or receiver port and associated interface signals for the 
memory, according to one embodiment of the invention. 

As shown, a FIFO memory 500 receives an initjn signal 506 and transmits an 
init_out signal 510, which (as described in more detail below) are control signals for 
initialization and generation of a logical connection that is used to transmit data through 
10 the different progranunable processors 202. The FIFO memory 500 receives a data_in 
signal 502 that inputs data into one of the entries of the FIFO memory 500. 

The FIFO memory 500 also illustrates a number of grant/request signals. As is 
further described below, in an embodiment, the ports 404 use a handshake protocol for the 
transmission of data based on these grant/request signals. Accordingly, this grant/receive 
15 protocol allows for a data driven architecture, wherein the image process operations are 
driven by the data on which such operations execute. 

The FIFO memory 500 receives a requestjn signal 504, which is a control signal 
from a FIFO memory in a different port that mputs data mto an entry of the FIFO memory 
500. The FIFO memory 500 transmits a grant_in signal 508 to this FIFO memory, in 
20 response to the request_in signal 504, that indicates that this FIFO memory may transmit 
data into the FIFO memory 500. 

The FIFO memory 500 transmits a request^out signal 512 to a FIFO memory of a 
different port to request the transmission of data from the FIFO memory 500 to this FIFO 
memory. The FIFO memory 500 receives a grant_out signal 516 from this FIFO memory, 
25 in response to the request_out signal 512, This grant_out signal 516 signals to the FIFO 
memory 500 that the different FIFO memory will receive data from the FIFO memory 
500. The FIFO memory 500 transmits a data_out signal 510 that transmits data to the 
different FIFO memory that granted transmission of the data (through grant_out signal 
516) in response to the request_out signal 512. 
30 Figure 6 illustrates a more detailed block diagram of a receiver port along with 

associated interface signals, according to one embodiment of the invention. In particular. 
Figure 6 illustrates one embodiment of a receiver port 406 (that includes a receiver FIFO 
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604) and associated interface signals. The receiver port 406 is within one of the ports 404 
(shown in Figure 4) and receives data into the programmable processor 202. 

The receiver FIFO 604 is coupled to receive and transmit interfece signals (the 
grant_in signal 508, the data_in signal 502, the request Jn signal 504 and the init_in signal 
5 506) to and from a transmitter port 408 that is external to the port ring 250 of the 
programmable processor 202. The receiver FIFO 604 is also coupled to receive and 
transmit interface signals (a number of grant_out signals 516A-516N, the data_out signal 
510, the request_out signal 512 and the init_out signal 514) from transmitter ports 408 that 
are internal to the port ring 250 of the programmable processor 202 or a processor element 

10 within the programmable processor 202. As shown, the grant_out signals 516A-516N are 
received into a multiplexer 602. The receiver port 406 uses a select signal 606 to cause 
the multiplexer 602 to select one of the grant_out signals 516A-516N to be inputted into 
the receiver FIFO 604. As described above, the host processor 214 configures the 
programmable processors 1202, wherein ou^ut from one processor element in a 

1 5 programmable processor 202 may be input to be processed by a different processor 
element in a different progranunable processor 202 through a logical coimection. 
Accordingly, the host processor 214 causes the receiver port 406 to assert the select signal 
606 to select the grant_out signal 516 from the appropriate transmitter port 408/input 
processor element 302. 

20 As described, the output from a first process operation in a first programmable 

processor 202 may be forwarded to a second programmable processor 202, wherem a 
second process operation is performed. In one embodiment, this output is transmitted 
through a logical coimection that comprises a nxmiber of ports 404 of a number of 
programmable processors 202. In an embodiment, an initialize signal is transmitted 

25 through the different ports 404 through which the data is transmitted for a given logical 
coimection. As described above, the architecture of the programmable processing unit 106 
is such that a given programmable processor 202 is not directly connected to every other 
programmable processor 202. Rather, a programmable processor 202 is connected to 
adjacent (nearest neighbor) devices. Therefore, if data is to be transmitted from one 

30 programmable processor 202 to another programmable processor 202, a logical connection 
is established through different ports of the different programmable processors 202 that 
the data traverses &om the source programmable processor 202 to the destination 
programmable processor 202. 
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Returning to Figure 2 to illustrate, assume that the output from a processor element 
within the programmable processor 202C is to be transmitted to a processor element 
within the programmable processor 202E for fiirther processing. One of a nimiber of 
logical connections may be established from the programmable processor 202C to the 
5 programmable processor 202E. One example of a logical connection is from the 

programmable processor 202C to the programmable processor 202B to the programmable 
processor 202A to the programmable processor 202E. A different example of a logical 
connection is from the programmable processor 202C to the programmable processor 
202G to the programmable processor 202F to the programmable processor 202E. In one 

10 embodiment, the host processor 214 determines the selection of the logical comiection 
based on the other active logical connections that may be using the same paths of 
communication. For example, if other logical connections are using the ports between the 
programmable processor 202B to the programmable processor 202A, the host processor 
214 may select the latter example logical coimection to reduce the latency for the data 

15 processing operations. 

In an embodiment, tihe port 404 from which the data originates is initialized. This 
initialization signal will be propagated flm)ugh the entire logical connection, tiiereby 
initializing the data path for tiiis given logical connection. This initialization signal is 
registered and passed through the different ports 404 as if the initialization signal were the 

20 data in order to prevent the propagation delays from accimiulating through long logical 
connections. In an embodiment, this initialization may include fliishing of flie receiver 
and transmitter FIFOs that are used in the logical connection. Therefore, if any data is 
within these FIFOs from a previous logical connection, this initialization causes the data to 
be deleted therefrom. In an embodiment, these different interface signals are handled in 

25 this manner to preclude large combinatorial delays through the logical connections. 
Therefore, routing between the different programmable processors 202 are processed 
through point-to-point coimections that are registered in the different ports 404 that are 
part of the logical connection. 

To illustrate, Figures 7A-7G illustrate a more detailed block diagram of a receiver 

30 port communicating with different transmitter ports, according to one embodiment of the 
invention. In particular, Figures 7A-7D illustrate a more detailed block diagram of the 
receiver port 406 communicating with the transmitter port 408 A that is external to the port 
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ring 250B. Figures 7E-7G illustrate a more detailed block diagram of the receiver port 
406 communicating with ttie transmitter port 408B that is internal to the port ring 250B. 

Figures 7A-7G illustrate the programmable processor 202A and the programmable 
processor 202B. The programmable processor 202A and the progranraiable processor 
5 202B include the port ring 250A and the port ring 250B, respectively. Additionally, the 
programmable processors 202A-202B include a number of receiver and transmitter ports. 
In particular, a given port 404 (shown in Figure 4) includes a transmitter port and a 
receiver port. However, for the sake of clarity, Figures 7A-7G illustrate either a 
transmitter port or receiver port for a given port 404. The port ring 250A of the 
10 programmable processor 202 A includes the transmitter port 408 A. The port ring 250B of 
the programmable processor 202B includes die receiver port 406 and the transmitter port 
408B. 

Figure 7A illustrates that the transmitter port 408A transmits the init_in signal 506 
to the receiver port 406 to flush the FIFOs that are part of the logical connection (between 

1 5 itself and the receiver port 406), Furthermore, Figure 7A illustrates that the receiver port 
406 forwards tiiis initialization through the logical connection, as the init_out signal S 14 to 
the transmitter port 408B through the port ring 2S0B. Accordingly, part of the logical 
connection includes the transmitto: port 408A, the receiver port 406 and the transmitter 
port 408B. This logical connection may include a number of other programmable 

20 processors 202. Therefore, this mitialization may have been received by the transmitter 
port 408A from a different programmable processor 202 through one of the internal 
receiver ports 406 of the port ring 250A. Additionally, the transmitter port 408B may 
forward this initialization may to another programmable processor 202. Once die 
initialization of the logical connection is complete, data may be transmitted through this 

25 logical connection. 

Figure 7B illustrates that the transmitter port 408 A uses the request Jn signal 504 
to request the inputting of data into the receiver port 406. Figure 7C illustrates that, in 
response to the request_in signal 504, and after storage is available in the receiver FIFO 
604 of the receiver port 406, the receiver port 406 uses the grant_in signal 508 to indicate 

30 to the transmitter port 408 A that the transmitter port 408A may transmit data into the 
receiver port 406. Figure 7D illustrates that the transmitter port 408 A uses the data_in 
signal 502 to transmit data for storage into the receiver FIFO 604 of the receiver port 406 
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when the request^in sig^al 504 and the grant_in signal 516 are active on the active edge of 
the clock signal controlling the programmable processing unit 106. 

Additionally as shown in Figure 6, the receiver port 406 transmits and receives 
interface signals from a transmitter port 408B which are both part of a same port ring 250. 
Figures 7E-7G illustrate such communications. 

Figure 7E illustrates that the receiver port 406 uses the request_out signal 5 12 to 
request the inputting of data into the transmitter port 408B (one of the internal transmitter 
ports of the port ring 250B). Figure 7F illustrates that, m response to the request_out 
signal 512, the transmitter port 408B transmits the grant^out signal 516 back to the 
receiver port 406. Figure 7G illustrates that the receiver port 406 uses flie data_out signal 
510 to transmit the data to the transmitter port 408B when the request_out signal 512 and 
the grant_out signal 5 1 6 are active on the active edge of the clock signal controlling the 
programmable processing unit 106* 

Furthermore, although not shown in Figures 7E-7G, the receiver port 406 may 
transmit/receive these interfeces signals (the request^out signal 512, the grant_out signal 
516 and the data_out signal 510) to/from the input processor element 302 (illustrated 
within Figure 3) for the programmable processor 202B. If the data within the receiver 
FIFO 604 is to be inputted to one of the processor elements (the input processor element 
302, the output processor element 304 and/or Ae processor elements 306A-306C) within 
this programmable processor 202 for processing therein, the receiver port 406 transmits 
the request_out signal 512 to the input processor element 302. If the data withm the 
receiver port 406 is to be transmitted to a device external to the progranmiable processor 
202 (e.g., a different programmable processor 202, one of the DMA units 204 or one of 
the external interfaces 208), the receiver port 406 transmits the request^out signal 512 to 
the appropriate transmitter port 408 (the port that is part of the logical connection). 

Figure 8 illustrates a more detailed block diagram of a transmitter port along with 
associated interface signals, according to one embodiment of the invention. In particular, 
Figure 8 illustrates one embodiment of the transmitter port 408 (which includes a 
transmitter FIFO 806) and associated interface signals. The transmitter port 408 is within 
one of the ports 404 (shown in Figure 4) and is to transmit data out from the 
programmable processor 202. 

As shown, a number of the initjn signals 506A-506H, a number of the data_in 

signals 502A-502H and a number of the request Jn signals 504A-504H are mputted into 

15 
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the transmitter port 408 fiom one of the receiver ports 406 that are internal to this 
programmable processor 202 (i.e., fbat are internal to the port ring 2S0 of the 
programmable processor 202). Additionally, the grant_out signal 5 16, the request_out 
signal 5 12, the data_out signal 510 and the init_out signal 514 are outputted from the 
5 transmitter port 408 to receiver ports 406 that are external to the port ring 250 for this 
programmable processor 202. 

The transmitter FIFO 806 is coupled to receive interface signals (the nimiber of the 
init_m signals 506A-506H, the number of the data-in signals 502A-502H and the number 
of the request_in signals 504A-504H) through a multiplexer 804A, a multiplexer 804B and 

10 a multiplexer 804C, respectively, from a number of receiver ports that are internal to the 
port ring 250 of the progranmiable processor 202 or the output processor element 304 (not 
shown in Figure 8). 

To illustrate, Figures 9A-9E illustrate a more detailed block dia^^m of a 
transmitter port commnnicating with different receiver ports that using dififerent mter&ce 

1 5 signals, according to one embodiment of the invention. In particular. Figure 9A illustrates 
a more detailed block diagram of the transmitter port 408 receivuig mterface signals from 
elements that are internal to the port ring 250 of the programmable processor 202 that the 
transmitter port 408 is associated. Figures 9B-9E illustrate a more detailed block diagram 
of the transmitter port 408 receiving interface signals from a receiver port 406 that is 

20 external to the port ring 250 of the programmable processor 202 that the transmitter port 
408 is associated. 

Figures 9A-9E illustrate the programmable processor 202A and the programmable 
processor 202B. The programmable processor 202A and the programmable processor 
202B include the port ring 250A and the port ring 250B, respectively. Additionally, the 

25 progranunable processors 202A-202B include a number of receiver and transmitter ports. 
In particular, a given port 404 (shown in Figure 4) includes a transmitter port and a 
receiver port. However, for the sake of clarity, Figures 9A-9E illustrate either a 
transmitter port or receiver port for a given port 404. The port ring 250A of the 
programmable processor 202A includes the receiver ports 406B-406H and the transmitter 

30 port 408. The port ring 250B of the programmable processor 202B includes the receiver 
port406A. 

With regard to Figure 9A, the output processor element 304 (within the 
programmable processor 202A) is coupled to transmit the init^in signal 506A, the datajn 
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signal 502A and the request Jn signal 504A. The receiver port 406B transmits the mit_ in 
signal 506B, the data_in signal 502B and the request_in signal 504B. The receiver port 
406C transmits the init_ in signal 506C, the data_in signal 502C and the request_in signal 
504C, The receiver port 406D transmits the init_ in signal 506D, the data_in signal 502D 
5 and the request^in signal 504D. The receiver port 406E transmits the init_ in signal 506E, 
the data_m signal 502E and the request Jn signal 504E. The receiver port 406F transmits 
the init_ in signal 506F, the data_in signal 502F and the request Jn signal 504F. The 
receiver port 406G transmits the init_ in signal 506G, the data_m signal 502G and the 
request_in signal 504G. The receiver port 406H transmits the init_ in signal 506H, the 

10 data_in signal 502H and the request_in signal 504H. 

With regard to Figure 8, the transmitter FIFO 806 within the transmitter port 408 . 
uses a select signal 802 to cause the multiplexers 804A-804C to select one of the init_in 
signals 506, one of the data_in signals 502 and one of the request_in signals 504. As 
described above, ttie host processor 108 configures the programmable processing unit 106, 

1 5 wherein output from one processor element in a programmable processor 202 may be 
input to be processed by a different processor element in a different programmable 
processor 202 through a logical coimection. Accordingly, the host processor 108 causes 
the transmitter FIFO 806 to assert the select signal 802 to select the init_in signal 506, the 
data_in signal 502 and the request_in signal 504 from the appropriate source. Returning to 

20 Figure 2 to help illustrate, if a receiver port receives data into the programmable processor 
202B and is to output the data through a transmitter port 408 in the programmable 
processor 202B to a receiver port in ftie programmable processor 202A, the host processor 
108 would configure this transmitter port 408 to select signal 606 from this receiver port. 
Accordingly, the selected receiver port 406 (or the selected output processor 

25 element 304) uses the initjn signal 506 to initialize the logical connection. In an 

embodiment, this initialization may include flushing of the receiver and transmitter FIFOs 
in the ports that are used in the logical connection. Therefore, if any data is within these 
FIFOs (prior to this initialization), this initialization causes the data to be deleted 
therefrom. Additionally, the selected receiver port 406 (or the selected output processor 

30 element 304) uses the request Jn signal 504 to request the input of data into the transmitter 
FIFO 806 for the transmitter port 408. The selected receiver port 406 (or the selected 
output processor element 304) uses data-in signal 502 to transmit data mto the transmitter 
FIFO 806. 
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Additionally as shown in Figure 8, the transmitter port 408 transmits and receives 
intwiace signals from the receiver port 406A of a different progranamable processor 202 
(the prograimnable processor 202B). Figures 9B-9E illustrate such communications. 

Figure 9B illustrates that the transmitter port 408 outputs the init_out signal 5 14 to 
5 the receiver port 406A to which it is attached to generate the logical connection prior to 
the transmission of data (as described above). Figure 9C illustrates that the transmitter 
port 408 outputs the request_out signal 5 12 to request the inputting of data into the 
receiver FIFO of the receiver port 406 A. Figure 9D illustrates that, in response, after 
space is available in the receiver FIFO of the receiver port 406A, the receiver port 406A 
1 0 outputs the grant_out signal 516 that is received by the transmitter port 408. Figure 9E 
illustrates that, in response, the transmitter port 408 outputs data from the transmitter FIFO 
806 to the receiver FIFO of the receiver port 406A using the data_out signal 510. 
Logical Connections among Programmable Processors 

Figure 10 illustrates a number of different routes for a given logical coimection 
1 5 from a somx^e progranunable processor to a destination programmable processor, 

according to one embodiment of the invention. As described above, the host processor 
108 can establish a number of logical connections for the transmission of data from a 
source programmable processor 202 to a destination programmable processor 202. In 
particular, the ou^ut of one processing operation by an element in a first programmable 
20 processor 202 may be used as input for a different processing operation by an element in a 
second programmable processor 202. 

For example, assume that the programmable processing unit 106 is in an image 
processing mode (which is described in more detail below). The first programmable 
processor 202 may perform an operation to reduce gjiosting in the video, while the second 
25 programmable processor 202 receives the output from the first programmable processor 
202 and may perform an operation to reduce noise in the video. A third programmable 
processor 202 may receive the output from the second programmable processor 202 and 
may perform an operation to reduce the dot-crawl in the video. In an embodiment, 
different programmable processors 202 perform different data operations because (as 
30 described in more detail below) one programmable processor 202 may have dedicated 
hardware accelerators for performing a given operation. 

Moreover, while this example illustrates the output of an operation in one 
programmable processor 202 being transmitted directly to a different programmable 
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processor 202, embodiments of the invention are not so limited. In an embodiment, one 
programmable processor 202 may transmit the ou^t of an operation to one of the 
memories 104. Accordingly, a second programmable processor 202 may retrieve the 
stored data from the memory 104. Such operations may be used when the second 
5 programmable processor 202 may require a certain amoxmt of the output from the first 
operation prior to its operations. For example, the second programmable processor 202 
may perform a temporal noise reduction operation across a nxmiber of frames of the video. 
Accordingly, the output from the first programmable processor 202 is stored in the 
memory 108 until a predetermined number of frames have been processed. Continuing 

1 0 with this example, the first programmable processor 202 may continue to perform the 
ghost reduction operation of the incoming video, while, simultaneously, the second 
programmable processor 202 may perform the temporal noise reduction operation of the 
outputted video (as described above). Similarly, the programmable processing unit 106 
may perform multiple operations across different programmable processors 202 during the 

1 5 storage mode (which is further described below). 

Because the architecture of the processors has a point-to-point configuration (as 
illustrated in Figure 2), the first programmable processor 202 may not be directly 
connected to the second progranunable processor 202. Therefore, a logical connection 
from the first programmable processor 202 (the source programmable processor 202) to 

20 the second programmable processor 202 (the destination programmable processor 202) 
through one to a number of intermediate programmable processors 202 is established. 

Figure 10 illustrates the programmable processing unit 106 of Figure 2, along with 
five different routes for a given logical connection from the programmable processor 
202A to the programmable processor 202H. 

25 A first route 1002 for a logical connection starts at the programmable processor 

202 A (the source programmable processor) and goes through the port ring 250D of the 
programmable processor 202D (a first intermediate programmable processor) and 
completes at the port ring 250H of the progranunable processor 202H (the destination 
programmable processor). In particular, the data is transmitted from a transmitter port of 

30 the port ring 250A of the programmable processor 202A to a receiver port of the port ring 
250E of the programmable processor 202D. The receiver port of the port ring 250D of the 
programmable processor 202D transmits the data to a transmitter port of the port ring 
250D of the programmable processor 202D (through the port ring 250D of flie 
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programmable processor 202D). This transmitter port of the port ring 2S0D of the 
programmable processor 202D transmits the data to a receiver port of the port ring 2S0H 
of the programmable processor 202H. 

A second route 1004 for a logical connection starts at the programmable processor 
5 202A (the source programmable processor) and goes througji the programmable processor 
202E (a first intermediate programmable processor) and completes at the programmable 
processor 202H (the destination programmable processor). In particular, the data is 
transmitted from a transmitter port of the port ring 250A of the programmable processor 
202A to a receiver port of the port ring 250E of the programmable processor 202E. The 

10 receiver port of the port ring 250E of the programmable processor 202E transmits the data 
to a transmitter port of the port ring 2S0E of the programmable processor 202E (through 
tihe port ring 250E of the programmable processor 202E). This transmitter port of the port 
ring 250E of the programmable processor 202E transmits the data to a receiver port of the 
port ring 2S0H of the programmable processor 202H. 

15 A third route 1006 for the logical connection starts at the programmable processor 

202A (the source programmable processor) and goes through the programmable processor 
202E (a first intermediate programmable processor) through the programmable processor 
202F (a second intermediate programmable processor) through the programmable 
processor 202G (a third intermediate programmable processor) and completes at the 

20 programmable processor 202H (the destination programmable processor). In particular, 
the data is transmitted from a transmitter port of the port ring 250A of the programmable 
processor 202 A to a receiver port of the port ring 250E of the programmable processor 
202E. The receiver port of the port ring 250E of the programmable processor 202E 
transmits the data to a transmitter port of the port ring 250E of the programmable 

25 processor 202E (through the port ring 250E of the programmable processor 202E). This 
transmitter port of the port ring 250E of the programmable processor 202E transmits the 
data to a receiver port of the port ring 250F of the programmable processor 202F. The 
receiver port of the port ring 250F of the programmable processor 202F transmits the data 
to a transmitter port of the port ring 250F of the progranmiable processor 202F (through 

30 the port ring 250F of the programmable processor 202F). This transmitter port of the port 
ring 250F of the programmable processor 202F transmits the data to a receiver port of the 
port ring 250G of the programmable processor 202G. The receiver port of the port ring 
250G of the programmable processor 202G transmits the data to a transmitter port of flie 



wo 2005/067290 



PCT/US2004/043080 



port ring 250G of the programmable processor 202G (through the port ring 250G of the 
programmable processor 202G), This transmitter port of the port ring 250G of the 
programmable processor 202G transmits the data to a receiver port of the port ring 250H 
of the programmable processor 202H. 
5 A fourth route 1008 for the logical connection starts at the programmable 

processor 202A (the source programmable processor) and goes through the programmable 
processor 202B (a first intermediate programmable processor) through the programmable 
processor 202C (a second intermediate programmable processor) through the 
programmable processor 202D (a third intermediate programmable processor) and 

10 completes at the programmable processor 202H (the destination programmable processor). 
In particular, the data is transmitted from a transmitter port of the port ring 2S0A of the 
programmable processor 202A to a receiver port of the port ring 2S0B of the 
programmable processor 202B. Ite receiver port of the port ring 250B of the 
programmable processor 202B transmits die data to a transmitter port of the port ring 

IS 2S0B of tiie programmable processor 202B (through the port ring 250B of the 
programmable processor 202B). This traxismitter port of the port ring 2S0B of flie 
programmable processor 202B transmits the data to a receiver port of the port ring 2S0C 
of die programmable processor 202C. The receiver port of the port ring 2S0C of the 
programmable processor 202C transmits the data to a transmitter port of the port ring 

20 250C of the programmable processor 202C (through the port ring 250C of the 

programmable processor 202C). This transmitter port of the port ring 250C of the 
programmable processor 202C transmits the data to a receiver port of the port ring 250D 
of the programmable processor 202D. The receiver port of the port ring 250D of the 
programmable processor 202D transmits the data to a transmitter port of the port ring 

25 250D of the progranmiable processor 202D (through the port ring 250D of the 

programmable processor 202D). This transmitter port of the port ring 250D of the 
programmable processor 202D transmits the data to a receiver port of the port ring 250H 
of the programmable processor 202H. 

A fifth route 1010 for the logical connection starts at the programmable processor 

30 202A (the source programmable processor) and goes through the programmable processor 
202B (a first intermediate programmable processor) through the programmable processor 
202F (a second intermediate programmable processor) through the programmable 
processor 202G (a third intermediate programmable processor) and completes at the 
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programmable processor 202H (the destination programmable processor). Accordingly, as 
shown, one to a number of different routes can be used to establish a logical connection 
between two different programmable processors 202. In particular, the data is transmitted 
from a transmitter port of the port ring 250A of the programmable processor 202A to a 

5 receiver port of the port ring 250B of the programmable processor 202B. The receiver 
port of the port ring 250B of the programmable processor 202B transmits the data to a 
transmitter port of the port ring 250B of the programmable processor 202B (through the 
port ring 250B of the programmable processor 202B). This transmitter port of the port 
ring 250B of the programmable processor 202B transmits the data to a receiver port of the 

10 port ring 250F of the programmable processor 202F. The receiver port of the port ring 
2S0F of the programmable processor 202F transmits the data to a transmitter port of the 
port ring 250F of the programmable processor 202F (through the port ring 250F of the 
progranraiable processor 202F). This teansmitter port of the port ring 250F of the 
programmable processor 202F transmits the data to a receiver port of tiie port ring 250G of 

1 5 the programmable processor 202G. The receiver port of the port ring 250G of flie 
programmable processor 202G transmits the data to a transmitter port of the port ring 
250G of the programmable processor 202G (through the port ring 250G of the 
programmable processor 202G). This transmitter port of the port ring 250G of the 
programmable processor 202G transmits the data to a receiver port of the port ring 250H 

20 of flie programmable processor 202H. 

As described, the traversal through an intermediate programmable processor 202 is 
through the ports 404 of the port ring 250 and not through processor elements or other 
components intemal to the programmable processor 202. Therefore, the processor 
elements within an intermediate programmable processor 202 do not perform any type of 

25 operation on data that is transmitted from the source programmable processor 202 and the 
destination programmable processor 202. 

Therefore, this architecture uses a combination of hardwired point-to-point 
connections which are configurable. A transmitter port is connected to a predefined 
destination, which allows for direct wiring of the die of the programmable processing unit 

30 106. However, a given transmitter port can select one of several sources for the 
transmitted data. In turn, a receiver port makes its data available to a mmiber of 
transmitter ports. This architecture allows for eflBcient routing of data and control within 
the port ring 250 for a progranmiable processor 202. Moreover, passing the initialize 
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signal through a logical connection allows for single-point clearing of the logical path that 
the data is to traverse at the source of &e data and ensuie that the intermediate connections 
do not need to be cleaned up or emptied before or after data transfers. Moreover, logical 
connections that transfer an indetenninate amount of data and get backed up or stalled can 
5 be cleared out with a single command beginning at the source and traversing the logical 
connection. 



Operations of the Programmable Processing Unit 
The operations of the programmable processing unit 106, according to one 
10 embodimrat, are now described. . . . 



Data Transmission Operations among Programmable Processors 

Figure 11 illustrates a flow diagram for establishing and initializing of a logical 
connection within a progranunable processing unit, according to one embodiment of the 
15 invention. 

In block 1 102, configuration data for a logical connection to be established for 
transmission of data is received. With reference to Figure 2, the different programmable 
processors 202 (the source programmable processor, the intermediate progranmiable 
processor(s) and the destination programmable processor) receive the configuration data 

20 for a logical connection to be established for transmission of data. In an embodiment, the 
host processor 108 transmits this configuration data to these programmable processors 202 
through the internal global bus 212. In an embodiment, the host processor 108 may also 
download microcode into the programmable processors 202 that are part of the logical 
connection. For example, the host processor 108 may download a specific application into 

25 the source and/or destination programmable processor 202. Control continues at block 
1104. 

In block 1 104, the logical connection is established. With reference to Figures 6 
and 8, the receiver ports 406 and the transmitter ports 408 (through which data is 
transmitted as part of the logical connection) establish the logical coimection based on the 
30 configuration data received. As described above, the receiver ports 406 use the select 
signals 606 to determine which grant_out signal 516 will be selected by multiplexer 602. 
For example, if the data received into the receiver port 406A is to be outputted to the 
transmitter port 408D, then the configuration data causes the receiver port 406A is use the 
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select signal 606 to select the grant_out signal S16 associated widi the transmitter port 
408D. Similarly, the transmitter ports 408 uses the select signals 802 to determine which 
of the request_in signal 504, the data_in signal 502 and the init_in signal 506 will be 
selected by the multiplexer 804C, the multiplexer 804B and the multiplexer 804A, 
5 respectively. Control continues at block 1 106. 

In block 1 106, the logical connection is initialized. With reference to Figures 2, 6 
and 8, the transmitter port 408 for the source programmable processor 202 that is to 
originate this logical connection transmits the init_out signal 514 to tiie receiver port 406 
of the next source programmable processor 202 involved with this logical connection. 

10 This receiver port 406 receives this signal as init^in signal 506 and ou^uts the init_out 
signal 5 14 to the transmitter port within this source programmable processor 202. This 
transmission of init_out signals 514 and receipt of init_in signals 506 continues along the 
logical connection until the transmitter port 408 of the destination programmable 
processor 202 is reached* Accordingly, this initialize signal initializes the different ports 

15 involved in the logical connection. In an embodiment, this initialization may include 
flushing of the receiver and transmitter FIFOs that are used in the logical connection. 
Therefore, if any data is within these FIFOs from a previous logical coimection, tiiis 
initialization causes the data to be deleted there£ix>m. 

In an embodiment, a series of process operations are performed/executed by 

20 different components in different progranmiable processors 202 within the progranmiable 
processing unit 106. The output of a first process operation is used as input to a second 
process operation, etc. As described above, logical connections are established for the 
transmission of the data to the different programmable processors 202. Therefore, a 
logical connection is established for each transmission from one element in the 

25 programmable processing unit 106 to a different element in the programmable processing 
unit 106. One embodiment for the processing of data in the programmable processing unit 
106 will now be described. Figure 12 illustrates a flow diagram for processing of data by 
a programmable processing imit, according to one embodiment of the invention. In 
particular, the flow diagram 1200 describes the processing of data by one of the 

30 programmable processors 202 within the progranunable processing unit 106, according to 
one embodiment of the invention. 

In block 1202, a stream of data is received. With reference to Figure 2, a first of 
the programmable processors 202 receives the stream of data from one of a number of 
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sources. For example, the programmable processor 202A may receive the stream of data 
fiom an external source (such as the video signal processor 104). The programmable 
processor 202 A may also receive the stream of data from the memory 108 through the 
memory interface 206 and the DMA unit 204. Control continues at block 1204. 

5 In block 1204, the stream of data is processed in a first programmable processor. 

With reference to Figure 2, a component (e.g., one of the processor elements 302, 304, 
306A-306C or one of the accelerator units 3 lOA-3 lOB) within the first programmable 
processor 202 performs a first process operation. The input processor element 302 
receives the data through the receiver port 406. In an embodiment, any of the processor 

10 elements 302, 304, 306A-306C performs/executes the process operation on the received 
data. In one embodiment, as part of the configuration of the logical connection of which 
the process operation is associated, the host processor 214 may indicate which of the 
components in the first programmable processor 202 is to perform/execute the process 
operation. Accordingly, the input processor element 302 may store the data into the 

1 S memory 314 wherein the designated components retrieves the data and p^orms/executes 
the first process operation on such data. The first programmable processor 202 may 
output a result for processing a part of the stream of data, while continuing to process a 
different part of the stream of data. For example, the first programmable processor 202 
may output a result for processing the first frame of video, while continuing to process 

20 subsequent firames of the video. Control continues at block 1206. 

In block 1206, the output of the process operation is transmitted/forwarded to a 
different programmable processor or a memory through a logical connection. With 
reference to Figures 2 and 3, the output processor element 304 (in flie programmable 
processor 202 in which the first process operation is performed/executed) 

25 transmits/forwards the output of the process operation through a transmitter port 408 that 
is part of the configured logical connection to a different programmable processor 202 or 
to the memory 108 through the configured logical connection. Control continues at block 
1208. 

In block 1208, the result is processed in the different programmable processor. 
30 Similar to the processing in the first programmable processor (described in block 1204), a 
component (e.g., one of the processor elements 302, 304, 306A-306C or one of the 
accelerator units 310A-310B) within the different programmable processor 202 performs a 
different image process operation. Control continues at block 1210. 
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In block 1210, a determination is made of whether the process operations for the 
stream of data are completed. In particular, the current programmable processor 202 that 
is processing a part of the stream of data determines whetiier die output of its operations is 
to be transmitted to a different programmable processor 202 or to the memory 108 through 
5 a logical connection based on configuration data received from the host processor 214. In 
particular, the host processor 214 may configure the programmable processing unit 106 to 
receive a stream of data and to perform five different compression/storage process 
operations in five different programmable processors 202. Accordingly, the host 
processor 214 configures the different logical connections to transmit the data to the five 

.0 different programmable processors 202 in a given ord^. Upon determining that the 

process operations are not complete for the stream of data, control continues at block 1006 
wherein the result of the processing is outputted to a different programmable processor 
202 or the memory 108. The operations of block 1206 and 1206 continue until the process 
operations are complete for the stream of data. 

IS In block 1212, upon determining that the process operations are complete for the 

stream of data, the results are outputted. With reference to Figure 2, in one embodiment, 
if the current mode is the storage mode, the final programmable processor 202 in the chain 
of programmable processors to process the stream of data outputs the result to the memory 
108. With reference to Figure 1, in an embodiment, if the current mode is the image 

20 processing mode, the final programmable processor 202 ou^uts the result to tiie video 
display processor 110 through the I/O interface 208. 

One embodiment of the operations for the transmission of data between different 
ports of the programmable processors 202 based on a handshake protocol will now be 
described. In particular, Figures 13 A-13B illustrate flow diagrams for communications 

25 among memories of different ports in a programmable processor, according to 

embodiments of the invention. By way of example and not by way of limitation, the 
operations of the flow diagram 1300 and 1330 are described such that the FIFO memories 
within these different ports have a depth of two (i.e., a two-entry FIFO). Figure 13A 
illustrates a flow diagram for receiving data into a memory of a port, while Figure 13B 

30 illustrates a flow diagram for transmitting data out of a memory of a port. 

In block 1302, a request to receive data is received into a receiver port of a port 
ring of a programmable processor. With reference to Figure 6, the receiver port 406 
receives a request to receive data through the request_in signal 504. As described above, a 
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transmitter port 408 that is coupled to the receiver port 406 transmits this request. Control 
continues at block 1304. 

In block 1304, a determination is made of whether the receiver FIFO of the 
receiver port is full. With reference to Figure 6, the receiver port 406 determines whether 
5 the receiver FIFO 406 is full Upon determining that the receiver FIFO 604 of the receiver 
port 406 is full, control continues at block 1304 where this determination is again made. 
In an embodiment, this request may time out after a predetermmed period, wherein an 
alarm is issued to the host processor 108 and the operation of the flow diagram 1300 are 
aborted. 

lO In block 1306, upon determining that the receiver FIFO 604 of the receiver port 

406 is not full, a determination is made of whether the receiver FIFO is one-half full. As 
described above, the receiver FIFO 604 is described as having a depth of two. Wifli 
reference to Figure 6, the receiver port 406 determines whether the receiver FIFO 604 is 
one-half fiill In other words, the receiver port 406 determines whether the receiver FIFO 

15 604 is empty or has data in one entry. Upon determining fliat the receiver FIFO is not one- 
half full (i.e,, the receiver FIFO is empty), control continues at block 13 10, v4uch is 
described in more detail below. 

In block 1308, upon determining that the receiver FIFO is one-half full, data stored 
in the first entry m the receiver FIFO is moved to the second entry in the receiver FIFO. 

20 With reference to Figure 6, the receiver port 406 moves the data stored in the first entry to 
the second entry in the receiver FIFO 604. Control continues at block 1310. 

In block 13 10, a grant is sent to the requesting hransmitler port (the transmitter port 
requesting to send data to the receiver port). With reference to Figure 6, the receiver port 
406 transmits a grant through the grant Jn signal 508 to the transmitter port 408, thereby 

25 indicating that the transmitter port 408 may transmit data into the receiver FIFO 604. 
Control continues at block 1312. 

In block 1312, received data is stored into the receiver FIFO of the receiver port. 
With reference to Figure 6, the receiver port 406 stores the received data into the furst 
entry of the receiver FIFO 604, which is received from the transmitter port 408 through 

30 the data_in signal 502. 

An embodiment of transmitting data out of a memory of a port is now described in 
conjunction with the flow diagram 1330 of Figure 13B. In block 1332, a request to output 
data to a receiver port is transmitted. With reference to Figure 8, the transmitter port 408 
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transmits the request to output data to the receiver port 406 (to which the transmitter port 
408 is coupled) through the request^out signal 512. Control continues at block 1334. 

In block 1334, a determination is made of whether a grant has been received from 
the receiver port. With reference to Figure 8, the transmitter port 408 determines whether 
5 a grant has been received from the receiver port 406 based on the value of the grant_out 
signal 516. Upon determining that the grant has not been received from the receiver port 
406, control continues at block 1334, wherein the transmitter port 408 again makes this 
determination. In an embodiment, this checking of a grant may time out after a 
predetermined period, wherein an alarm is issued to the host processor 108 and the 
1 0 operation of the flow diagram 1 300 are aborted. 

In block 1336, upon determining that the grant has been received from die receiver 
port, a determination is made of whether the transmitter FIFO is one-half fiill. With 
reference to Figure 8, the transmitter port 408 determines whether the transmitter FIFO 
706 is one-half fiill. Because the operations of the flow diagram 1330 have been initiated, 
15 the assumption is that the transmitter FIFO 706 is not empty. 

In block 1338, \xpon determinmg that the transmitter FIFO is not one-half full (the 
transmitter FIFO is full), data from the second entry of the transmitter FIFO is outputted to 
the receiver FIFO. With reference to Figure 8, the transmitter port 408 outputs the data 
from the second entry of the transmitter FIFO 706 through the data_out signal 510 to the 
20 receiver FIFO, thereby completing the operations of the flow diagram 1330. 

In block 1340, upon determining that the transmitter FIFO is one-half fidl, data 
from the fu^t entry of the transmitter FIFO is outputted to the receiver FIFO. With 
reference to Figure 8, the transmitter port 408 ou^uts the data from the first entry of the 
transmitter FIFO 706 through the data_out signal 510 to the receiver FIFO, thereby 
25 completing the operations of the flow diagram 1330. 

While the flow diagrams 1300 and 1330 describe the communications between 
receiver and transmitter ports that are part of different port rings, the handshake protocol 
operations described are also applicable to communications between receiver and 
transmitter ports that are part of the same port ring. Moreover, such handshake protocol 
30 operations are applicable for the inputting and outputting of data into the input processor 
element 302 and the output processor element 304, respectively. 

Accordingly, as described in Figure 13A-13B, in an embodiment, the data driven 
architecture for image process operations is based on this handshake protocol for 



wo 2005/067290 



PCT/US2004/043080 



transmitting data through the different port for logical connections. A bubble is a clock 
period where no data transaction occurred (i.e., data was not moved in the given clock 
period). For example, data was not ready to be transmitted at flie beginning and/or data 
was not retrieved at the end of the logical connection. Therefore, there may be an empty 

5 place in the logical connection because data was not put into the logical connection. 

Moreover, as described, if a bubble forms in the logical connection because of a data stall 
condition at the source programmable processor or the destination programmable 
processor, then data is paused for a single clock period. In other words, the FIFO 
memories within the receiver and transmitter ports allow for bubbles in the logical 

1 0 connection that do not grow jfrom stopping and restarting of the data flow within the 

logical connection. Embodiments of the invention are such fliat a bubble does not force a 
delay at either end of the logical connection beyond the bubble. The bubble does not 
require die programmable processing unit 106 to resync (which may require more clock 
periods to recov^ than the number of clocks periods associated with the bubble itself). 

15 

Multiple Mode Operations of the Programmable Processing Unit 

The multiple mode operations of the programmable processing unit 106 are now 
described. Figure 14 illxistrates a flow diagram for multiple mode operations of a 
programmable processing unit, according to one embodiment of the invention. 

20 In block 1402 of the flow diagram 1400, video is received. With reference to the 

embodiment of Figure 1, the programmable processing unit 106 receives video from the 
video signal processor 104 through the I/O interface 208. In an embodiment, the received 
video is a stream of digital data. 

In block 1404, a determination is made of whether the system is in a storage mode. 

25 With reference to the embodiment of Figure 2, the host processor 214 makes this 

determination. In an embodiment, the system 100 is in one of two modes: 1) a storage 
mode or 2) an image processing mode. The host processor 214 may make this 
determination based on input from a user of the system 100. In an embodiment, the 
system 100 is within a television (or part of a set-top box or media center coupled thereto) 

30 wherein a user selects the current mode using a remote control. For example, the user may 
cause the system 100 to be in the storage mode when pausing tiie display of the video to 
the video display 1 12. The user may also cause the system 100 to be m the storage mode 
when rewinding the video being displayed on the video display 1 12. Accordingly, the 
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host processor 214 receives this input from the user. In one embodiment, I/O logic (not 
shown) within the system 100 may receive and process input for the system 100. Such I/O 
logic can forward this input to the host processor 214. 

In an embodiment, the current mode of the system 100 remains in the storage mode 

5 until the memory 108 does not have stored video therein. For example, if the user 
discontinues a pause operation the display on the video display 1 12 (thereby resuming 
display), the system 100 continues in the storage mode until the memory 108 does not 
include stored video. Therefore, if the memory 108 includes stored video, the system 100 
may remain in the storage mode until the user performs a fast-forward operation (for 

0 example to skip a commercial). Such an operation causes the rate of output of the video 
from the memory 108 to exceed the rate of input of the incoming video therein. 

Upon determining that the current mode of the system 100 is not the storage mode 
(but rather m the image processing mode), control continues at block 1412, which is 
described in more detail below. Upon determining that the current mode of the system 

IS 100 is in ttie storage mode, control continues at blocks 1406 and 1410, which are now 
described. 

In block 1406, the video is compressed. With reference to the embodiment of 
Figures 2 and 3, one or more processor elements in one or more programmable processors 
202 compress the video. In one embodiment, as part of the compression of the video, a 

20 number of different noise reduction operations may be performed. In an embodiment, the 
compression of the video may include image scaling, frame rate reduction, etc. Returning 
to Figure 2 to help illustrate, a processor element in the programmable processor 202A 
may perform a noise reduction operation. The programmable processor 202E can receive 
the output from this noise reduction operation. A processor element in the programmable 

25 processor 202E can reduce the image in the video through an image scale operation. The 
programmable processor 202H may receive the output from this image scale operation. A 
processor element in the programmable processor 202H may fiirther compress the video 
through a fi^e rate reduction operation. 

Accordingly, as described in this example, in an embodiment, the host processor 

30 214 downloads the appropriate microcode mto the appropriate programmable processors 
202. In particular, the host processor 214 may download microcode into tiie 
programmable processors 202 when tiie current mode of the system 100 changes. For 
example, assume that the processor elements in the programmable processor 202 have 
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been configured to perfonn an enhanced video processing operation (as fur&er described 
below) as part of the image processing mode. Therefore, after the cunent mode of the 
system 100 is changed to the storage mode, the host processor 214 may download 
microcode into the programmable processor 202H for the firame rate reduction operation. 

5 Additionally, the host processor 214 may configure the logical connections among 

the programmable processors 202 for processing (as described above in conjunction with 
the description of Figure 10). In an embodiment, at least one of the programmable 
processors 202 may be configured to store its result into the memory 108, which is 
subsequently retrieved by a different programmable processor 202. For example, if one of 

.0 the compression operations is performed across a number of firames of the video, a givm 
number of firames of video may be stored in the memory 108 prior to processing by the 
different progranmiable processor 202. 

Returning to tiie example above regarding the different processor elements in fiie 
different programmable processors performing the different compression operations, flie 

IS host processor 214 establishes a logical connection fi:om the I/O mterface 208 to the 
programmable processor 202A. The host processor 214 also establishes a logical 
connection firom tiie programmable processor 202A to the programmable processor 202E. 
The host processor 214 also establishes a logical connection firom the programmable 
processor 202E to the programmable processor 202H. Control continues at block 1408. 

20 In block 1408, the compressed video is stored. With reference to the embodunent 

of Figure 2, one of the programmable processors 202 stores the compressed video. As 
described above, in an embodiment, a number of compression operations are performed on 
the video by at least one of the programmable processors 202. The programmable 
processor 202 that completes the compression operations may store the compressed video 

25 into the memory 108. Returning to the example described above in relation to the block 
1406, after completion of the frame rate reduction operation, the programmable processor 
202 stores this result into the memory 108 through a logical connection (between the given 
programmable processor 202 and the memory 108 as described above). 

In block 1410, the video is displayed (independent of enhanced video processing) 

30 based on control input. As described above, in an embodiment, the system 100 is part of a 
television or other viewing device. A user of such device controls the displaying of the 
video on the video display 1 12. For example, the user may pause, replay, fast forward or 
play the video on the video display 1 12 using a remote control. Accordingly, the control 
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input is received from tiie user and transmitted to the host processor 2 14. If the control 
input is to play the video, tiie host processor 214 establishes a number of logical 
connections to uncompress the video that is stored in the memory 108 and to output such 
video to the video display processor 1 10 for display on the video display 1 12. 
5 To illustrate, assume that an image scaling operation and a frame rate reduction 

operation are performed to compress the video stored in the memory 108. Therefore, the 
host processor 214 establishes logical connections among the programmable processors 
202 that are to perform the decompression operations. For example, a logical connection 
may be established between the memory 108 and the programmable processor 202F. A 

10 processor element in the programmable processor 202F may retrieve the compressed video 
from ttie memory 108 and insert frames back into the video (using, for example, 
interpolation). A different logical connection may be established between the 
programmable processor 202F and the programmable processor 202D. The video from 
the results from the progranmiable processor 202F are transmitted to the programmable 

IS processor 202D through the established logical connection. Hie programmable processor 
202D may perform an image scale operation to increase the image size of the video. 
Another logical connection may be established between the programmable processor 
202D and the I/O interface 208. The video from the results from the programmable 
processor 202D are transmitted to the I/O interface 208 through the established logical 

20 connection. The video is outputted therefrom to the video display processor 110. 

While described such tiiat there is one-to-one relationship between the number of 
compression operations and the decompression operations, embodiments are not so 
Ihnited. Returning to the example above to illustrate, in one embodiment, the insertion of 
frames back into the video is not performed in order to increase the execution of the 

25 decompression and display. 

With regard to the display of video based on the control input (at block 1410), the 
display of video on the video display 1 12 may be paused. Accordingly, the 
decompression operations described above are not performed. Additionally, the 
programmable processing xmit 106 continues to compress and store the incoming video 

30 into the memory 108 (as described above m blocks 1406 and 1408). 

Based on the control input, the display of the video on the video display 1 12 may 

be teplayed. The host processor 214 receives the control input that mdicates that the video 

is to be replayed. Accordingly, the host processor 214 configures the programmable 

32 
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processors 108 to decompress and display previous fiames in reverse order until the 
control input is changed or the memory 108 does not include any stored video. 

Based on the control input, the display of the video on the video display 1 12 may 
be fast forward until the incoming video is what is being displayed. For example, a user of 
5 the system 100 may desire to skip a commercial Therefore, the host processor 214 
configures the programmable processors 108 to decompress and display the video at a 
faster rate in comparison to when the video is being played. 

Returning to the flow diagram 1400, in block 1412, upon determining that the 
system 100 is not in the storage mode (but rather the image processing mode), enhanced 

10 video processing is performed. With reference to the embodunent of Figure 2, one or 
more processor elements in one or more programmable processors 202 perform the 
enhanced video processing. In one embodiment, the enhanced video processing includes 
ghost reduction operations, noise reduction operations (spatial, temporal, etc.), dot-crawl 
reduction operations, etc. 

IS Returning to Figure 2 to help illustrate, in one embodiment, the host processor 214 

establishes logical connections among the I/O inter&ce 208 and different programmable 
processors 202 involved in the enhanced video processing (as described above). 
Moreover, the host processor 214 may download nucrocode to be executed as part of the 
enhanced video processing operation. For example, assume that the programmable 

20 processor 202B is configured to perform a ghost reduction operation; the progranamable 
processor 202H is configured to p^orm a spatial noise reduction operation; the 
programmable processor 202C is configured to perform a temporal noise reduction 
operation; and the programmable processor 202E is configured to perform a dot-crawl 
reduction operation. 

25 Accordingly, the host processor 214 may download microcode into the 

programmable processor 202B, the programmable processor 202H, the programmable 
processor 202C and/or the programmable processor 202E. The host processor 214 also 
establishes a logical connection from the I/O interface 208 (where the data is received) to 
the programmable processor 202B (where the ghost reduction operation is performed), 

30 The host processor 214 also establishes a logical connection from the programmable 

processor 202B and the programmable processor 202H (where the spatial noise reduction 
operation is performed). Additionally, assuming that the temporal noise reduction 
operation is performed over a number of frames of the video data, the host processor 214 
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establishes a logical connection from the programmable processor 202H to the memory 
108 (for storage of a number of frames of the video data prior to the temporal noise 
reduction operation). The host processor 214 also establishes a logical connection from 
the memory 108 to the programmable processor 202C. Therefore, after the given number 

5 of frames of the video data has been stored in the memory 108 from the output of the 
programmable processor 202H, the frames are inputted to the programmable processor 
202C (where the temporal noise reduction operation is performed). The host processor 
214 also establishes a logical connection from the programmable processor 202C and the 
programmable processor 202E (where the dot-crawl reduction operation is performed). 

10 The host processor also establishes a logical connection from the programmable processor 
202E to the I/O interface 208, where the video data is output for display. 

In block 1414, the video with enhanced video processing is displayed. With 
reference to tiie embodiment of Figures 1 and 2, the I/O mterface 208 outputs the video 
data for display to the video display processor 1 10. 

IS Additionally, in an embodiment, after the current mode of the system 100 is 

changed, the current operations are interrupted and control continues at point one, where a 
determination regarding the mode of the system at block 1404. For example, if the current 
mode is flie image processing mode and flie user selects the pause button to pause the 
display on the video display 1 12, the current mode is changed to Ihe storage mode and 

20 control is interrupted and continued at point one. 

While the flow diagram 1400 illustrates the compression of the video prior to 
storage, embodiments of the invention are not so limited. In another embodiment, the 
video is received and stored into the memory 108 without compression operations. 
In the description, numerous specific details such as logic implementations, 

25 opcodes, means to specify operands, resource partitioning/sharing/duplication 
implementations, types and interrelationships of system components, and logic 
partitioning/integration choices are set forth in order to provide a more thorough 
understanding of the present invention. It will be appreciated, however, by one skilled in 
the art that embodiments of the invention may be practiced without such specific details. 

30 In other instances, control structures, gate level circuits and ftiU software instruction 
sequences have not been shown in detail in order not to obscure the embodiments of the 
invention. Those of ordinary skill in the art, with the included descriptions will be able to 
implement appropriate frmctionality without undue experimentation. 
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References in the specification to "one embodiment**, "an embodinaenf *, "an 
example embodiment**, etc., indicate that the embodiment described may include a 
particular feature, structure, or characteristic, but every embodiment may not necessarily 
include the particular feature, structure, or characteristic. Moreover, such phrases are not 
5 necessarily referring to the same embodiment* Further, when a particular feature, 

structure, or characteristic is described in connection with an embodiment, it is submitted 
that it is within the knowledge of one skilled in the art to affect such feature, structure, or 
characteristic in connection with other embodiments whether or not explicitly described. 
Embodiments of the invention include features, methods or processes that may be 

. 0 embodied within machine-executable instructions provided by a machine-readable 
mediimi. A machine-readable medium includes any mechanism which provides (i.e.» 
stores and/or transmits) information in a form accessible by a machine (e.g., a computer, a 
network device, a personal digital assistant, manu&cturing tool, any device with a set of 
one or more processors, etc.)- In an exemplary embodiment, a machine-readable medium 

15 mcludes volatile and/or non-volatile media (e.g., read only memory (ROM), random 

access memory (RAM), magnetic disk storage media, optical storage media, flash memory 
devices, etc.), as well as electrical, optical, acoustical or other form of propagated signals 
(e.g., carrier waves, infrared signals, digital signals, etc.)). 

Such instructions are utilized to cause a general or special purpose processor, 

20 progranmied wiA the instructions, to perform methods or processes of the embodiments of 
the invention. Alternatively, the features or operations of embodiments of the invention 
are performed by specific hardware components which contain hard- wired logic for 
performing the operations, or by any combination of programmed data processing 
components and specific hardware components. Embodiments of the invention include 

25 software, data processing hardware, data processing system-implemented methods, and 
various processing operations, fiuther described herein. 

A number of figures show block diagrams of systems and apparatus for a 
programmable video processing/storage architecture, in accordance with embodiments of 
the invention. A number of figures show flow diagrams illustrating operations for a 

30 progranmiable video processing/storage architecture. The operations of the flow diagrams 
will be described with references to the systems/apparatus shown in the block diagrams. 
However, it should be understood that the operations of the flow diagram could be 
performed by embodiments of systems and apparatus other than those discussed with 
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reference to the block diagrams, and embodiments discussed with reference to the 
systems/apparatus could perform operations different Aan those discussed with reference 
to the flow diagram. 

In view of the wide variety of permutations to the embodiments described herein, 
5 this detailed description is intended to be illustrative only, and should not be taken as 
limiting the scope of the invention. To illustrate, while operations have been described 
with reference to the programmable processing unit 106, embodiments of the invention are 
not so limited. For example, in an embodiment, the multiple mode operations may be 
performed by software applications executing in a single general purpose processor. 

10 Moreover, while described with reference to the processing of video, embodiments of the 
invention are not so limited. For example, in one embodiment, the multiple mode 
operations may be applicable to processmg of other types of data such as audio, a 
combmation of audio and video, etc. To further illiistrate possible permutations to 
embodiments described herein, while the multiple mode operations are described in 

IS reference to two modes, embodiments of the invention are not so limited as there can be a 
greater mmiber of such modes. For example, in an embodiment, a third mode of tiie 
programmable processing unit 106 may involve other aspects of video processing. 
Moreover, while embodiments have been described with regard to an analog tuner wherein 
an RF signal is received and an analog signal is genemted, embodiments of tiie invention 

20 are not so limited. For example, m one embodiment, the video tuner 102 may be a digital 
tuner to receive the RF signal and to generate a digital signal. What is claimed as the 
invention, therefore, is all such modifications as may come within the scope and spirit of 
the following claims and equivalents thereto. Therefore, the specification and drawings 
are to be regarded in an illustrative rather than a restrictive sense. 

25 
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What is claimed is: 

1 . A method comprising: 

receiving video into a video display device; 

storing, by at least one processor, the video into a memory, upon determining that 
the video display device is in a storage mode; and 
5 performing enhanced image processing on the video with the at least one 

processor, upon detennining that the video display device is in an image processing mode. 

2. The method of claim 1, further comprising compressing, by the at least one 
processor, the video prior to storing the video into the memory, upon determining that the 
video display device is within the storage mode. 

10 3. The method of claim 2, wherein compressing, by the at least one processor, the 
video comprises performing fiame reduction on the video, by a first processor of the at 
least one processor. 

4. The method of claim 3, wherein compressing, by the at least one processor, the 
video comprises scaling, by a second processor of the at least one processor, of the video. 

IS S. The method of claim 1, wherein performing enhanced image processing on the 
video comprises performing a ghost reduction operation. 

6. The method of claim 1, wherein performing enhanced image processing on the 
video comprises performing a noise reduction operation. 

7. The method of claim 1, wherein performing enhanced image processing on the 
20 video with the at least one processor comprises performing a first enhanced image 

processing operation on the video with a first processor of the at least one processor and 
performing a second enhanced image processing operation on the video with a second 
processor of the at least one processor. 

37 
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8. The method of claim 7, further conq>rising transmitting output of the first enhanced 
image processing operation firom the first processor to the second processor through a 
logical connection that includes a path trough a third processor, wherein the third 
processor is not configured to perform a process operation on the output as part of the 

5 transmitting of the output through the logical connection. 

9. An apparatus comprising: 

an input/output interface coupled to a memory; and 

a first programmable processor to perform a fnrst enhanced image processing 
operation on video if a current mode of the apparatus is an image processing mode, and 
10 wherein the first programmable processor is to store the video into the memory if the 
current mode of the apparatus is a storage mode. 

10. The apparatus of claim 9, fiirther comprising a second programmable processor to 
perform a second enhanced image processing operation on the video if ttie current mode of 
the apparatus is the image processing mode, and wherein the second processor is to 

15 perform a video compression operation to the video prior to storage of the video into the 
memory if the current mode of flie jqpparatus is a storage mode. 

11. The apparatus of claim 10, wherein the video compression operation is selected 
firom a group consisting of an hnage scale operation and a frame rate reduction operation. 

12. The apparatus of claim 10, further comprising a third programmable processor to 
20 perform a third enhanced image processing operation on the video if the current mode of 

the apparatus is the image processing mode, and wherein the third processor is to perform 
a noise reduction operation on the video prior to the video compression operation if the 
current mode of the apparatus is the storage mode. 

13. The apparatus of claim 9, further comprising a host processor to receive control 

25 input firom a user of a video display device that includes the apparatus, wherein the control 
input is to cause a change in the current mode, wherein tiie host processor is to update 
microcode in the first programmable processor if the control input is to cause a change 
fix)m the current mode. 
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14. An apparatus comprising: 
a memory 

a first programmable processor to perfonn a first enhanced image processing 
operation on video if a current mode of the apparatus is an image processing mode, and 
5 wherein the first programmable processor is to perform a compression operation on the 
video if the current mode of the apparatus is a storage mode; and 

a second programmable processor to receive output firom the first enhanced image 
processing operation through a first logical connection that includes a path through a third 
programmable processor and to perform a second enhanced image processing operation on 
10 the video if the currOTt mode of the apparatus is the image processing mode, and wherein 
the second programmable processor is to receive output firom the compression operation 
through a second logical connection that includes a path through a fourth progranmiable 
processor and to store the video into the memory if &e current mode of the apparatus is 
the storage mode. 

15 15. The apparatus of claim 14, fiirther comprising a host processor to receive control 
input from a user of a video display device that includes the apparatus, wherein the control 
input is to cause a change in the current mode, wherein the host processor is to create the 
first logical connection if the current mode of the apparatus is the storage mode, tiie host 
processor to create the second logical connection if the current mode of the apparatus is an 

20 image processing mode. 

16. The apparatus of claun 15, wherein the host processor is to update microcode in 
the first programmable processor and the second programmable processor if the control 
input is to cause a change from the current mode. 

17. The apparatus of claim 16, wherein the control input to cause the current mode to 
25 be the storage mode is a pause operation of display of the video on a display of the video 

display device. 
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18. The apparatus of claim 16, wherein fhe control input to cause the current mode to 
be the storage mode is a rewind operation of display of the video on a display of the video 
display device. 

19. A system comprising: 

5 a video tuner to receive a signal, wherein the video tuner is to extract an analog 

video signal at a frequency from the signal; 

a video signal processor to receive the analog video signal and to convert the 
analog video signal to a digital video signal; 

a double data rate (DDR) random access memory (RAM); and 
10 a programmable processing unit having at least one programmable processor and a 

host processor, wherein the host processor is to configure the at least one programmable 
processor to compress and store the digital video signal in the DDR RAM if the system is 
in a storage mode, and wherein the host processor is to configure the at least one 
prognunmable processor to perform at least one image processing operation if the system 
15 is in an image processing mode. 

20. The system of claim 1 9, wherein the at least one programmable processor is to 
perform the at least one image processing operation across multiple firames of the digital 
video signal. 

21 . The system of claim 19, wherein a first programmable processor of tiie at least one 
20 programmable processor is to store results of a first image processing operation of the at 

least one image processing operation in the DDR RAM and wherein a second 
programmable processor of the at least one programmable processor is to retrieve the 
results from the DDR RAM and is to perform a second image processing operation of the 
at least one image processing operation. 

25 22. The system of claim 19, wherein the at least one image processing operation is 

selected from a group consisting of a ghost reduction operation, a temporal noise reduction 
operation and a spatial noise reduction opemtion. 
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23. A machine-readable medium tiiat provides instructions, which when executed by a 
machine, cause said machine to perform operations comprising: 

receiving video into a video display device; 

storing, by at least one processor, the video into a memory, upon determining that 
5 the video display device is in a storage mode; and 

performing enhanced image processing on the video with the at least one 
processor, upon determining that the video display device is in an image processing mode. 

24. The machine-readable medium of claim 23, further comprising compressing, by 
the at least one processor, the video prior to storing the video into the memory, upon 

10 determining that ttie video display device is within the storage mode. 

25. The machine-readable medium of claim 24, wherein compressing, by the at least 
one processor, the video comprises performing frame reduction on the video, by a first 
processor of the at least one processor. 

26. The machine-readable medium of claim 25, wherein compressing, by the at least 
15 one processor, the video comprises scaling, by a second processor of the at least one 

processor, of the video. 

27. The machine-readable medium of claim 23, wherein performing enhanced image 
processing on the video con^rises performing a ghost reduction operation. 

28. The machine-readable medium of claim 23, wherein performing enhanced image 
20 processing on the video comprises performing a noise reduction operation. 

29. The machine-readable medium of claim 23, wherein performing enhanced hnage 
processing on the video with the at least one processor comprises performing a first 
enhanced image processing operation on the video with a first processor of the at least one 
processor and performing a second enhanced image processing operation on the video 

25 with a second processor of the at least one processor 
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30. The machine-readable medium of claim 29, furflier comprising transmitting output 
of the first enhanced image processing operation from the first processor to the second 
processor through a logical connection that includes a path through a third processor, 
wherein the third processor is not configured to perform a process operation on the output 
5 as part of the transmitting of the output through the logical connection. 
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